Improved MO-LRT VAD based on bispectra Gaussian model
نویسندگان
چکیده
Introduction: Voice activity detection (VAD) remains a challenging problem in speech processing and affects a number of applications including noise reduction for digital hearing aid devices, speech recognition systems and speech coding for discontinuous speech transmission (DTX) in mobile and IP networks. During the last decade, researchers have paid attention to the study of discriminative features for classification and noise robust decision rules. A highly cited work is the VAD proposed by Sohn et al. [1], which is based on the evaluation of a single feature vector likelihood ratio test (LRT) and assumes a Gaussian model for the noisy signal DFT coefficients. The proposed algorithm considers a generalised LRT involving multiple and independent observations of the bispectra. The experimental analysis shows significant improvements over standardised and recently reported VAD methods.
منابع مشابه
Improved likelihood ratio test based voice activity detector applied to speech recognition
Nowadays, the accuracy of speech processing systems is strongly affected by acoustic noise. This is a serious obstacle regarding the demands of modern applications. Therefore, these systems often need a noise reduction algorithm working in combination with a precise voice activity detector (VAD). The computation needed to achieve denoising and speech detection must not exceed the limitations im...
متن کاملImproved Likelihood Ratio Test Detector Using a Jointly Gaussian Probability Distribution Function
Currently, the accuracy of speech processing systems is strongly affected by the acoustic noise. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activity detector (VAD). This paper presents a new voice activity detector (VAD) for improving speech detection robust...
متن کاملCepstral domain voice activity detection for improved noise modeling in MMSE feature enhancement for ASR
In this paper we investigate the use of voice activity detection (VAD) for improving noise models used for cepstral domain minimum mean squared error (MMSE) filtering of noisy speech. Due to the popularity of MFCC features for speech recognition, it is useful to have VAD methods and MMSE filtering algorithms that both work in the MFCC domain. We propose a method for VAD based on the likelihood ...
متن کاملAn Efficient VAD Based on a Hang-Over Scheme and a Likelihood Ratio Test
The emerging applications of wireless speech communication are demanding increasing levels of performance in noise adverse environments together with the design of high response rate speech processing systems. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activ...
متن کاملA uniformly most powerful test for statistical model-based voice activity detection
This paper presents a new voice activity detection (VAD) method using the Gaussian distribution and a uniformly most powerful (UMP) test. The UMP test is employed to derive the new decision rule based on likelihood ratio test (LRT). This method requires the Rayleigh distribution for the magnitude of the noisy spectral component and the adaptive threshold estimated from the noise statistics. Exp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005